Comparing spectral densities of stationary time series with 

unequal sample sizes 



Abstract 

This paper deals with the comparison of several stationary processes with unequal sample sizes. We provide a 
detailed theoretical framework on the testing problem for equality of spectral densities in the bivariate case, after 
which the generalization of our approach to the m dimensional case and to other statistical applications (like testing 
for zero correlation or clustering of time series data with different length) is straightforward. We prove asymptotic 
normality of an appropriately standardized version of the test statistic both under the null and the alternative and 
investigate the finite sample properties of our method in a simulation study. Furthermore we apply our approach to 
cluster financial time series data with different sample length. 

AMS subject classification: 62M10, 62M15, 62G10 

Keywords and phrases: spectral density, integrated periodogram, cluster analysis, time series, stationary process, unequal 
length 

1 Introduction 

The comparison and clustering of different time series is an important topic in statistical data analysis and has various 
applications in fields like economics, marketing, medicine and physics, among many others. Examples are the grouping 
of stocks in several categories for portfolio selection in finance or the identification of similar birth and death rates 
in population studies. One approach to identify similarities or dissimilarities between two stationary processes is to 
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compare the spectral densities of both time series, which directly yields to the testing problem for equality of spectral 
densities in multivariate time series data. This problem has found considerable interest in the literature [see for example 



Jenkins (1961) or De Souza and Thomson (1982) for some early results], but in the nonparametric situation nearly all 



proposed procedures are only reasoned by simulation studys or heuristic proofs, see Coates and Diggle (1986), Potscher 



and Reschenhofer ( 1988 1 , Diggle and Fisher ( 1991 ) and Maharaj ( 2002 1 among many others. Most recently Eichler ( 2008 ) , 



Dette and Paparoditis (2009), Dette et al. (2011), Jentsch and Pauly (2011) and Dette and Hildebrandt (2011) provided 



mathematical details for the above testing problem using different L2-type statistics, but nevertheless in all mentioned 
articles it is always required that the different time series have the same length, which is typically not the case in practice. 



Caiado and Pena ( 2009 ) considered different metrics for the comparison of time series with unequal sample sizes in a 



simulation study and Jentsch and Pauly (2012) provided a theoretical result, which however does not yield a consistent 



test as it was also pointed out by the authors. 



This paper generalizes the approach of Dette et al. (2011 ) to the case of unequal sample sizes and yields a consistent test 



for the equalness of spectral densities for time series with different length. Although the limiting distribution will be the 



same as m 



Dette et al. (2011 ) note that our proof is completely different. This is due to the fact that one essential part in 



the proofs of Dette et al. (2011 1 is that the different processes have the same Fourier coefHcents which is not given if the 



observed time series have different sample sizes. For the sake of brevity we will focus on the case of two (not necessarily 
independent) stationary processes, but the results can be easily extended to the case of an m dimensional process. 
Our aim throughout this paper is to estimate the L2-distance D'^ := /^^(/ii(A) — /22(A))^(iA, where /ii(A) and /22(A) 
are the spectral densities of the first and the second process respectively. Under the null hypothesis 



Ho: /11(A) = /22(A) 



(1.1) 



the distance D"^ equals zero while it is strictly positive if /11(A) 7^ /22(A) for A G A, where A is a subset of [— 7r,7r] 
with positive Lebesgue measure. We will estimate by sums of the (squared) periodogram, where the sum goes over 
the Fourier coefhcents of the smaller time series. Asymptotic normality both under the null and the alternative will be 
derived and since the variance terms can be easily estimated also under the alternative, asymptotic confidence intervalls 



and a precise hypothesis test can be constructed next to the test for (1.1) [see Remark 2.2 . Furthermore our approach 



has much wider application like testing for zero correlation, discriminant analysis or clustering of time series with unequal 
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length [see Remark 2.3-2.4 , and a simulation study will indicate that some of our assumptions are in fact not necessary 
(for example our method seems to work also for Long Memory processes). 



2 The test statistic 



Let ni,n2 & IN with ni < 712 and consider the two stationary time series 



(2.1) 



where the z[^^ are independent and identically standard normal distributed for j — 1,2 and 



p if t2 = [hqm.nA - Vini, 



else 



IJ 



(2.2) 



where qm. 



and p G [0, 1]. This roughly speaking means that changes in the time series with less observations 



influence the more frequently observed series but not vice versa, which is for example the case if interest rates and stock 

returns are compared. Throughout the paper we also assume that the technical condition 

00 



< 00 



(2.3) 



is satisfied for an a > 1/2 (j = 1,2). Note that the assumption of Gaussianity is only imposed to simplify technical 
arguments [see Remark |2.5| . Furthermore innovations with variances different to 1 can be included by choosing other 
coefficents i/'P''- We define the spectral densities fij{\) = 1,2) through 



1 

2^ 



V'F^exp(-iAO 



/i2W:=f E V'PVS^exp(-zA(;-m)) = /21(A). 



/,m— — 00 



An unbiased (but not consistent) estimator for fjj{X) is given by the periodogram 



2iTn 



1 , 

—\yx, 



exp(-iAO 



(2.4) 



and although the periodogram does not estimate the spectral density consistently, a Riemann-sum over the Fourier 
coefficents of an exponentiated periodogram is (up to a constant) a consistent estimator for the corresponding integral 



over the exponentiated spectral density. For example. Theorem 2.1 in Dette et al. (2011) yields 



Kn. := - E A'(Ai,fc) 



p ^ 



(2.5) 



where Ai fe 



27rfe 



(k = 1, .... [^J) are the Fourier coefficents of the smaUer time series X^^\ If we can show that 



p J_ 
^ 2tt 



1 ^"^^ , 

^ fc=i 



ni 



fc=i 



An 



/ii(A)/22(A)dA=:Di2, 



(2.6) 
(2.7) 



we can construct an consistent estimator for through Z)^^ :— i(£)i „j + D2,ni) ~ 2Z)i2^„j. Although (2.6) looks very 



much like (2.5), note that the convergence in (2.6 1 is different since the coefhcents Ai.fe are not necessarily the Fourier 



(2) J I 

coefHcents of the time series . This implies that the proof of (2.6) has to be done in a completely different way than 



the proof of (2.5 1 in Dette et al. (2011). We now obtain the following main theorem. 



Theorem 2.1 ///ii(A), /22(A) and /12(A) are Holder continuous of order L > 1/2 and 



ri2 

Hi 



Q 



for a Q £ IR, then as ni 00 



(2.8) 



v/^(Di,„, -Di,Di2,„, -i?i2,D2,„, -D2) ^ iV(0, (E„)^_^.^ J 



with 
Sii = 5 

^23 



r A\(A)dA, E12 - r ./fi(A)/22(A)rfA+ r fhimi2w\', 

J —TT J —IT J —TV 
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/l2(A)/2l(A)+4 / /ll(A)|/i2(A)|V22(A), E22 



/A(A)/2^2(A)dA 



/ll(A)|/l2(A)|2/22(A)dA 



fn{X)fl,{X)dX+ /2'2(A)|/i2(A)P, 233 = 5/ /2*2(A)rfA 



Although condition (2.8) imposes some restrictions on the growth rate of ni and 712, it is not very restrictive, since 



in practice there usually occur situations where even 712 = Qni holds for a Q e W (if for example daily data are 
compared with monthly data) and on the other hand this condition needs only to be satisfied in the limit. From Theorem 



2.1 



D 



it now follows by a straightforward application of the Delta-Method that y^(-D„^ — D ) > N{0,a ), where 



^2 i I Eii±S33 + 45^22 + ^ - 2Si2 - 2E23}, which becomes a^^ = ^ /A(A)dA + ^ /^^ |/i2|^dA under Hq. To 
obtain a consistent estimator for the variance under the null hypothesis we define 

/12(A) : = ^pl^ exp(-iApi) J2 4f exp(zAp2) - MA) 

27riVnin2 '^^ 

^ Pl = l P2 = l 



and analogous to the proof of Theorem |2.1| it can be shown that 

L¥J , L"i/2j-i 

4ni 



-, L^J L"i/2j-i 
^ fc=l ^ fc=l 



Therefore an asymptotic niveau-a-test for (1.1) is given by: reject (1.1) if 



1)2 

"1 



(2.9) 



where Mi_q, denotes the (1 — a) quantile of the standard normal distribution. This test has asymptotic power 
$ ( — ^~J^° ui-a I where $ is the distribution function of the standard normal distribution. This yields that 



the test (2.9) has asymptotic power one for all alternatives with D > 0. 



Remark 2.2 

It is straightforward to construct an estimator a^, which converges to the variance also under the alternative. This 
enables us to construct asymptotic (1 — a) confidence intervals for D^. The same statement holds, if we consider the 
normalized measure i?2 ^^g^^ ,which can be estimated by i?„ 



-Di,„i+-D2,„i 

From Theorem |2.1| and a straightforward application of the delta method, it follows that 



2 ^.2^ D^N{0,af), 



(2.10) 



where can be easily calculated. By considering a consistent estimator af for (which can be constructed through 



the corresponding Riemann sums of the periodogram) , (2.10) provides an asymptotic level a test for the so called precise 
hypothesis 

Ha : R'^ > e versus Hi : < e , (2.11) 



where e > [see Berger and Delampady (1987)]. This hypothesis is of interest, because spectral densities of time series 
in real-world applications are usually never exactly equal and a more realistic question is then to ask, if the processes 



have approximately the same spectral measure. An asymptotic level a test for (2.11) is obtained by rejecting the null 
hypothesis, whenever R^^ — e < -^^Ua- 

Remark 2.3 

Theorem |2 . 1 1 can also be employed for a cluster and a discriminant analysis of time series data with different length, since 
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it yields an estimator for the distance measure /22)i where 



dif,g) = 1-^ 



2/:./(A)5(A)dA 



1/2 



which can take values between and 1. A value close to indicates some kind of similarities between two processes, 
whereas a value close to 1 exhibits dissimilarities in the second-order structure. The distance measure d(/ii,/22) can be 
estimated by 



"12 = max 1 X — , 

V V ^l,ni + D2,ni I 



1/2 



(2.12) 



2X) 

where the maximum is necessary, because the term 1 — -g 1^ — might be negative. 



Remark 2.4 



The main ideas of the proof of Theorem 2.1 can be furthermore employed to construct tests for various other hypoth- 
esis. For example a test for zero correlation can be derived by testing for /12 = which can be done by estimating 
XT^ |/i2(A)p(iA. An estimator for this quantity is easily derived using the above approach and furthermore the calcula- 
tion of the variance is straightforward, which we omit for the sake of brevity. 



Remark 2.5 

Although we only considered the bivariate case, our method can be easily extended to an m dimensional process. Moreover, 
a cumbersome but straightforward examination yields that our test also has asymptotic level a, if we skip the assumption 
of Gaussianity since (under the null hypothesis) all terms which consist the fourth cumulants of the processes Z*^*) cancel 



out. A similar phenomenon can be observed for the tests proposed by Eichler (2008), Dette et al. (2011), Dette and 



Hildebrandt (2011) and Dette et al. (2011). 



3 Finite sample study 



3.1 Size and power of the test 



In this section we study the size and the power of test (2.9) in the case of finite samples. All simulations are based on 



1000 iterations and we consider all different combinations of ni,n2 E {256,384,512,640} with rii < n2- For the sake 
of brevity we only present the results for the case p = and note that the rejection frequencies do not change at all 



if we consider correlations different to zero. We furthermore tested our approach using non-hnear GARCH models and 
obtained a very good performance also in this case. The results are not displayed for the sake of brevity but are available 
from the authors upon request. To demonstrate the approximation of the nominal level, we consider the five processes 

X^:Xt^Zt, : Xt = -0.8Xt-i + Zt, : = - 0.8Zt_i, X"* : ~ i^Ai?/MA(0.45, 0, 0.8), 

: — Ztl[t<o.5T] + 0-8Xt^il[o,5T<t<0.75T] + Ztl[t>o.75T] for t = 1, T, 

where the i^yli?/Af A(0.45, 0, 0.8)-model corresponds to a LongMemory-process given by (1 — B)'^-^^Xt = (1 — 0.8B)Zt 
with the backshift-operator B (i.e. B^ Xt — Xt^j). Note that the models X"' and X''' both do not fit into the theoretical 
framework considered in section 2, since for the FARIMA{QAb, 0, 0.8)-process we obtain Yll^-oo = oo which contra- 
dicts (2.3 1 and the structural-break model X^ does not even has a stationary solution. Nevertheless since these models 
are of great interest in practice, we investigate the performance of our approach in these cases as well. The results are 
given m Table [l] and it can be seen that the test (|2.9[) is very robust against different choices of rii and n2- Furthermore 



our method also seems to work for the models X^ and X^ although the convergence is slightly slower. 

To study the power of the test we additionally present the results of a comparison of X^ with X^ for j G {1, 2} and X"'^ 

with X^ (all other comparison between the processes yield better results than the depicted ones). 

3.2 Real world data 



In this section we investigate how the clustering-method described in Remark 2.3 performs, if it is applied to real world 
data. Therefore we took three log-returns of stock prices from the financial sector, three log-returns from the health 
sector and two key interest rates. Exemplarily for the finance sector we choosed the stocks of Barclays, Deutsche Bank 
and Goldman Sachs and the health sector is represented by GlaxoSmithKline, Novartis and Pfizer. The key interest rates 
were taken from Great Britain and the EU and all time series data were recorded between March 1st, 2003 and July 29th, 
2011. While the interest rates data were observed monthly, the stock prices were recorded daily or weekly . However, 
even if two stock prices were observed daily they might differ in length, since they are for example traded on different 



stock exchanges with different trading days. The result of our cluster analysis using (2.12) is presented in the dendrogram 
given in Figure [l] We get three different groups which correspond to the finance sector, the health sector and the key 
interes rates. 
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4 Appendix: Technical details 



Proof of Theorem 2.1 By using the Cramer- Wold device, we have to show that 

for all vectors c G . For the sake of brevity, we restrict ourselve to the case c = (0, 1, 0)"^ since the more general follows 
with exactly the same arguments. Therefore we show Tm '■= \AiI(^i2,rii — ^12) ^ > -^(0, S22) and we do that by using 



the method of cumulants, which is described in chapter 2.3. of Brillinger ( 1981 1 (and whose notations we will make heavy 
use of), i.e. in the following it is proved that 

cum,(f„J = 0(1) for ; = 1 and ; > 3, (4.1) 
cum2(f„J ^^i^ E22, (4.2) 

which will yield the assertion. 



Proof of (4.1 1 for the case I = \: Because of the symmetry of the periodogram, it is 



and because of the standard normality of the innovations we obtain 

10^7(1) 7(1) 7(2) 7(2) ^ . (1) (1) ^,10/7(2) 7(2) ^ . (1) (2) ^,70/7(1) 7(2) ^ 

^^'^Pl-il^gi-'ni^P2-/2^92-m2i -^y'^pi-h'^qi-rni)H'^p:,-l2^q2-ni2) +H^pi-h^q2-rn2)H'^qi-rni^P2~lJ 

+ '^i^l]-h^pl-l2)'^i^il-r>ii^il-ni2) 

which yields that 1E(-Di2,ni) (without the 0(l/ni)-term) can be divided into the sums of three terms which are called A, 
B and C respectively. For the first term we obtain the conditions Pi — Qi + h ~ "ii,p2 = 92 + ^2 ~ "^2 (a-U others cases 
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are equal to zero). This results in 

-. L^J , oo m n2 

'-k ^ Y. ^ Y. 

I "1~"^ J li,l2,'mi,m2= — co 91=1 ?2 = 1 

^ l<iji+/i— mi<ni l<g2+'2— ™2<n2 



g-'Ai,fc(/i-mi)-iAi j.+i(i2-ni2) 



^ L 2^ J -, "1 '12 oo / 1 \ 

_ V V V V 7//^^ /(2) -iAi,fc(ii-mi)-jAi.fe+i(/2-m2) , f _L \ 

fc— I gi = l ij2 = l ii,'2,nii,"i2=— oo 



where the last equality follows from 



;:|;|<Mnj ^ !:|!|<Mnj 



with M a M, where ( 2.3 ) was used. It now follows by the Holder continuity condition that A equals f^^ /ii ( A)/22 (A)(iA+ 
o (^^^^ ■ If consider the summand B, we obtain the conditions qi = \_{p2 — ?2)9ni,n2j + "^i ~ L'i'ni,"2 ^ Ij i 
92 = L(Pi - h)qni.n2\ +m2~ L9ni,"2 ^ IJ which yields 



p2 1 



2ni ^-^ (27r)2ni?i2 

/,— I "1-1 ii,mi,i2,m2=— oo Pi = l P2 = l 



1< L(Pl-'l)9ni ,"2 J+™2- L9ni,n2 -"2 1 < L (P2 - ' 2 ) j , ^2 J +™ 1 - ["Jn 1 , "2 ~ ^ J -"l 

„,, , ,,-'-''l.f!(Pl-|(P2-'2)gn,,n, \+mi-\q 



_^(^)g-'-^l.fc(Pl-L(P2-;2)g„i,„2J+™l-L9>M."2-lJ)e~*-^l'''+l(P2-L(Pl-'l)«»ii,»i2J+™2-L9Tii,Ti2-lJ) 



If we now employ the identity 



L^J 1 if p = 0,±ni,±2ni,... 

E e-'^i-'-P = < , (4.4) 



"■1 _ „i-i I 

'^^"L^'s— J else 



it follows with (2.8) that if pi is chosen there are only finitely many p2 which yields a non-zero summand. Therefore we 



obtain that B — o{l/y^nl) and with the same arguments it can be shown that C = 0(1/^^77^). □ 



Proof of (4.21: It is 



cum2(v/m-Di2,«i) = — cum2(/i(Ai,fe)/2(Ai,fe+i)) H cum(/i(Ai,feJ/2(Ai,fei+i), /i(Ai,fc2)/2(Ai,fe2+i)) 

n 1 ^ — ^ rj 1 ^ — ^ 



Tlx — ' Til 

k—\ k\^h2—\ 

k\=^k2 

and the assertion follows if we show that 



- E cum2(/i(Ai,,)/2(Ai,fc+i)) ^^i^ A r /2,(A)/|2(A)dA, (4.5) 
- V cum(/i(Ai,feJ/2(Ai,fc,+i),/i(Ai,,2)/2(Ai.fc2+i)) r /ii(A)|/i2(A)P/22(A)dA. (4.6) 

7 1 — 7r 



fc=l 



'^1 i. i 1 
fci ,fc2 = l 

k\^k2 



We present a detailed proof of (4.51 and then comment briefly on (4.6) since it is proved analogously. Employing the 



symmetry of the periodogram again, we get 

— ^ cum2(/i(Ai^fe)/2(Ai,fe+i)) = - — ^ cum2(/i(Ai,fe)/2(Ai,fe+i)) + 0(l/ni) 

Til J T Zili 

g-«Ai,,(pi-qi+ri-si)-»Ai,,+i(p2-92+r2-s2)cuni(zf/^ „ Z^^^ , Z^^' „ Z^^^ , , Z^^^ „ Z^^^ , zf,^^ , Z^^^ , ) + 0(l/ni) 

V Pi — ai iji — bi P2— 02 92—02' ri—ci si—di r2— C2 S2 — a2-' V / 

, L^J 2 rij oo 

g-^Ai,,(pi-9i+n-si)-*Ai,,+i(p2-92+r2-s2)cuin(Zp^(i,j) e i/i)---cum(Zp^;(i,j) G 1^4) + 0(l/ni) 

where the sum goes over all indecomposable partitions = i^i U ... U 1^4 of 

7(1) ^(1) ^(1) ^(1) 

^pi-ai qi-bi P2-a2 92-^2 

7(2) 7(2) 7(2) 7(2) 

^ri-ci ^si-di ^r2-C2 ^53-^2 

with jj/il = 2 V« = 1, ...,4 (we only have to consider partitions with two elements in each set, because of the Gaussianity 
of the innovations; in the non-Gaussian case we would get an additional term containing the fourth cumulant). Every 
chosen partition will imply conditions for the choice of pj,qj,rj,Sj as in the calculation of the expectation. For some 
partitions there will not be a pj , qj , rj , Sj in the exponent of e after inserting the conditions and for other partitions 
there will still remain one. Let us take an example of the latter one and consider the partition which corresponds to 
cum(Zj[,^L^^,Z^^^LrfJcum(Z^^^Lf,^,^^iLcJcum(Z^f_„^,Z^^LcJcum(Z^^^^j,^,Z^^'_^J. We name the corresponding term of 



this partition in (4.7) with V2 and obtain the conditions pi = si + ai — di, qi = ri + 61 — ci, p2 = ^'2 + 02 — C2, 



92 = ^2 + ^^2 ~ ^2 which yields 



L^J 2 00 ni n2 

"-2^ E p^]^E E E E «' 

[ "i^~^ J ^ ' j = l aj ,bj ,Cj .dj=~oo si,ri = l i'2,S2 = l 



l<si +ai — di <ni I<r2+a2— C2<n2 
l<ri+6i— ci<ni l<S2+b2 — <i2<n2 



g-jAi_fc(ai-di+ci-6i)-iAi,fc-l-i(2r2-2s2+a2-C2+d2-ft2) 



^ (2^)4n2n2 ^ ^ J2 ^i'i^-^d2 

/,;— [ "1^~^ J j = l Oj ,6j ,Cj ,(ij= — 00 Si ,ri = l r2,S2 = l 

-iAi_fc + i(2r2-2s2)g-jAi,fc(ai-di+ci-bi)-jAi,fc + i(a2-C2+£i2-62) ^ j ^Jn\) 
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where the last equaUty again follows with (4.3 1. Now as in the handling of B in the calculation of the expectation, (4.4) 
implies that V2 — o(l). 

Every other indecomposable partition is treated in exactly the same way and there are only three partitions which 



corresponding term in (4.7) does not vanish in the limit. These partitions correspond to one of the following three terms: 



2) cum(4^L,^, zl^iL,Jcum(zJ^^^,^, 4^^L,Jcum(z(f_,^, zJ^2L,Jcum(4fL,^, ZI^L.J 

3) cum(z(^L,^, zl^iL,Jcum(zJ^^^,^, z(^^L,Jcum(z(f_,^, zl^L,Jcum(zJ^2L,^, 



We will exemplarily present the calculation concerning the 1) partition and denote the corresponding sum in (4.7) with 
Vi. We get that Vi equals 

L^J -. 2 00 ni 712 

I "1-1 ^ / J- ^ j = l aj ,bj ,Cj ,dj=—ca gi,si = l i'2,S2 = l 

^ l<(ji+ai— hi<ni l<S2+a2 — <i2<ra2 

l<si+ci — cii<ni I<r2+fe2— C2<"2 

L~i^J -.2 00 ni 712 

fc— ^'12 j = l aj ,bj ,Cj ,dj =—oa q-L,si = l r-2,S2 = l 

by using ( |4.3[ ). Now the Holder continuity condition implies Vi = /^^ /ii(A)^/22(A)^(iA + o{l/y/ni) and since the 



partitions 2) and 3) yield the same result, we have shown (4.5) 



With the same arguments as in the proof of (4.5 ) it can be seen that 



— ^ cum(/i(Ai,fcJ/2(Ai,fei+i),/i(Ai,fcJ/2(Ai,fc2+i)) 



"■1 I. ; 1 

/Ci,fc2=l 
fel#fc2 



1 

— ^ cum(/i(Ai,feJ/2(Ai^fci+i),/i(Ai,fei+i)/2(Ai,fe^+2)) 



^ fc,=i 



1 

- cum(Ji(Ai,fcJ/2(Ai,fe,+i),/i(Ai,fc,_i)/2(Ai,fcJ) +0(1) 



/ci — 1 



and it is shown completely analogously to the proof of (4.5) that 



Ecum(/i(Ai,feJ/2(Ai.fei+i),/i(Ai,fej+i)/2(Ai.fei+2)), — cum(/i(Ai,feJ/2(Ai,fci+i), /i(Ai,fc^_i)/2(Ai.feJ) 

n 1 ^ — ^ 



Til — ' n-i 

^ fc,=l ^ fe,=l 



both converge to ^ J^^ /ii(A)|/i2(A)|2/22(A)dA, which yields □ 
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Proof of (4.1 ) for the case I > 3: Since the proof is done by combining standard cumulants methods with the arguments 



that are used in the previous proof, we wiU restrict ourselve to a brief explanation of the main ideas. We obtain 



cum; (y^i^l2,„ J =7,^—17^ E E (0^^2l„l^l E E E ^ii^-'-V-f 



exp(-iAi,fe(pii - qix) - iAi,fc+i(pi2 - 912)) • • • exp(-iAi^A;(p;i - In) - i>^i,k+i{pi2 - qn)) 

riimrz^^) 7^^) 7(2) 7^^) 7^^) 7(2) 7(2) ^ 

V Pll-aii^qii-6ii'^Pi2-ai2 qi2-bl2' Pil-Qil ijii-bu Pi2-ai2 gi2-b!2'' 

and if we now take a indecomposable partition of 

7(1) 7(1) 7(2) ;7(2) 

•^pii-aii ^gii-611 '^pi2-ai2 gi2-fci2 

7(1) r,(l) y(2) „(2) 

Pil-au gii-ftll Pl2-ai2 qi2~bi2 

which consists only of sets with two elements (again this suffices because of the Gaussianity of the innovations), it 
follows directly that at most 21 of the 41 variables Pji.j2^1ji,j2 Ui = •■•i'j h — I72) are free to choose. By using the 
same arguments as in the calculation of the variance and the expectation it then follows by the indecomposability of the 
partition that in fact only /+1 of the remaining 21 variables Pji.j2, Qji,j2 fr'^e to choose. This implies cum/ ( y^rffZ?! 2, m) = 
0{n]^'^'^) which yields the assertion. □ 

References 

Berger, J. O. and Delampady, M. (1987). Testing precise hypotheses. Statistical Science, 2:317-335. 
Brillinger, D. R. (1981). Time Series: Data Analysis and Theory. McGraw Hill, New York. 

Caiado, J., C. N. and Pena, D. (2009). Comparison of times series with unequal length in the frequency domain. 
Communications in Statistics - Simulation and Computation, 38:527-540. 

Coates, D. and Diggle, P. (1986). Tests for comparing two estimated spectral densities. Journal of Time Series Analysis, 
7:7-20. 

De Souza, P. and Thomson, P. (1982). Lpc distance measures and statistical tests with particular reference to likelihood 
ratio. IEEE Transactions Accoustics Speech Signal Process, 30:304-315. 

12 



Dette, H. and Hildebrandt, T. (2011). A note on testing hypotheses for stationary processes in the frequency domain. 
Journal of Multivariate Analysis, 104:101-114. 

Dette, H., Kinsvater, T., and Vetter, M. (2011). Testing nonparametric hypotheses for stationary processes by estimating 
minimal distances. Journal of Time Series Analysis, 32:447-461. 

Dette, H. and Paparoditis, E. (2009). Bootstrapping frequency domain tests in multivariate time series with an application 
to comparing spectral densities. Journal of the Royal Statistical Society B, 71:831-8571. 

Dette, H., Preul3, P., and Vetter, M. (2011). A measure of stationarity in locally stationary processes with applications 
to testing. Journal of the American Statistical Association, 106(495):1113-1124. 

Diggle, P. and Fisher, N. (1991). Nonparametric comparison of cumulative periodograms. Journal of the Royal Statistical 
Society C, 40:423-434. 

Eichler, M. (2008). Testing nonparametric and semiparametric hypotheses in vector stationary processes. Journal of 
Multivariate Analysis, 99:968-1009. 

Jenkins, G. M. (1961). General considerations in the analysis of spectra. Technometrics, 8:133-166. 

Jentsch, C. and Pauly, M. (2011). Testing equality of spectral densities using randomization techniques. Preprint. 

Jentsch, C. and Pauly, M. (2012). A note on using periodogram-based distances for comparing spectral densities. Statistics 
& Probability Letters, 82(1):158-164. 

Maharaj, E. (2002). Comparison of non-stationary time series in the frequency domain. Computational Statistical Data 
Analysis, 40:131-141. 

Potscher, B. M. and Reschenhofer, E. (1988). Discriminating between two spectral densities in case of replicated obser- 
vations. Journal of Time Series Analysis, 9:221-224. 



13 



ni 


n2 


a 


x^ x^ 


X^X^ X^X^* 


X* X^ 


X^X^ 


256 


256 


0.05 
0.1 


0.041 0.039 0.045 
0.088 0.128 0.106 


0.773 0.628 
0.894 0.875 


0.053 0.053 
0.117 0.131 


0.523 
0.685 


256 


384 


0.05 
0.1 


0.049 0.049 0.034 
0.106 0.128 0.099 


0.758 0.619 
0.877 0.841 


0.047 0.054 
0.114 0.147 


0.551 
0.719 


256 


512 


0.05 
0.1 


0.043 0.047 0.026 
0.085 0.139 0.069 


0.776 0.650 
0.892 0.848 


0.045 0.031 
0.109 0.126 


0.539 
0.739 


256 


640 


0.05 
0.1 


0.050 0.044 0.030 
0.109 0.112 0.081 


0.777 0.636 
0.904 0.859 


0.037 0.046 
0.097 0.122 


0.563 
0.755 


384 


384 


0.05 
0.1 


0.036 0.047 0.037 
0.092 0.120 0.103 


0.920 0.804 
0.969 0.936 


0.054 0.043 
0.110 0.117 


0.693 
0.814 


384 


512 


0.05 
0.1 


0.048 0.039 0.045 
0.110 0.120 0.091 


0.895 0.828 
0.956 0.938 


0.061 0.065 
0.131 0.136 


0.699 
0.836 


384 


640 


0.05 
0.1 


0.037 0.045 0.046 
0.078 0.128 0.096 


0.917 0.788 
0.968 0.925 


0.050 0.062 
0.124 0.149 


0.702 
0.858 


512 


512 


0.05 
0.1 


0.037 0.034 0.048 
0.096 0.111 0.106 


0.975 0.877 
0.994 0.967 


0.044 0.027 
0.097 0.100 


0.800 
0.889 


512 


640 


0.05 
0.1 


0.037 0.054 0.046 
0.094 0.137 0.103 


0.971 0.890 
0.987 0.973 


0.055 0.061 
0.119 0.131 


0.811 
0.906 


640 


640 


0.05 
0.1 


0.044 0.035 0.037 
0.106 0.101 0.089 


0.993 0.959 
0.999 0.993 


0.045 0.043 
0.104 0.110 


0.906 
0.934 



Table 1: Rejection frequencies of the test (2.9) under the null hypothesis and several alternatives for p ~ 0. 
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Figure 1: Clustering of financial time series data. 
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